# Efficient text generation
Deepseek R1 0528 GGUF
MIT
A quantized model based on DeepSeek-R1-0528, focusing on text generation tasks and providing a more efficient way of use.
Large Language Model
D
lmstudio-community
1,426
5
Ytu Ce Cosmos.turkish Gemma 9b V0.1 GGUF
A large language model based on the Gemma architecture, specialized in Turkish text generation tasks.
Large Language Model
Y
DevQuasar
247
1
Huihui Ai.qwen3 4B Abliterated GGUF
The quantized version of Huihui AI's Qwen3-4B model, aiming to make knowledge more widely accessible to the public.
Large Language Model
H
DevQuasar
540
1
Qwen3 1.7B 4bit
Apache-2.0
Qwen3-1.7B-4bit is a 4-bit quantized version of the Tongyi Qianwen 1.7B model, which has been converted to the MLX framework format for efficient operation on Apple Silicon devices.
Large Language Model
Q
mlx-community
11.85k
2
Bamba 9B V2
Apache-2.0
Bamba-9B-v2 is a decoder-only language model built on the Mamba-2 architecture, focusing on text generation tasks and outperforming Llama 3.1 8B in performance.
Large Language Model
Transformers

B
ibm-ai-platform
3,634
15
Tesslate Tessa Rust T1 7B GGUF
Apache-2.0
A quantized version of Tessa - Rust - T1 - 7B, quantized using the llama.cpp tool, supporting efficient operation under different hardware conditions.
Large Language Model
Transformers English

T
bartowski
542
2
Qwen2.5 0.5B Instruct Gensyn Swarm Feathered Giant Ostrich
A fine-tuned model based on the Transformer architecture, which performs excellently in question-answering and text generation tasks, providing an accurate and efficient language interaction experience.
Large Language Model
Transformers

Q
chinna6
2,027
1
Gemma 3 4b It Abliterated GGUF
MIT
An innovative quantization solution that achieves smaller model size while maintaining high performance through mixed-precision quantization.
Large Language Model English
G
ZeroWw
247
4
Qwq 32B Gptqmodel 4bit Vortex V1
Apache-2.0
QwQ-32B is a 32B-parameter large language model based on the Qwen2 architecture, processed with 4-bit integer quantization using the GPTQ method, suitable for efficient text generation tasks.
Large Language Model
Safetensors English
Q
ModelCloud
1,620
11
L3.3 Electra R1 70b GGUF
This project provides multiple GGUF matrix quantization versions of the Steelskull/L3.3-Electra-R1-70b model, effectively improving the model's operating efficiency and performance in different hardware environments.
Large Language Model
Transformers

L
ddh0
2,376
6
Llama 3.1 0x Mini
0x Mini is a lightweight language model developed by Ozone AI, optimized based on the Llama-3.1 architecture, providing efficient text generation capabilities
Large Language Model
Transformers

L
ozone-research
21
5
Fikri 3.1 8B Instruct
Fikri is a language model specifically tailored for Turkish language tasks, based on the Llama 3.1 architecture with 8 billion parameters.
Large Language Model
Safetensors
F
BrewInteractive
3,226
6
Mistral
Other
Mistral 7B is a large language model launched by Mistral AI with 7 billion parameters. It is designed for high efficiency and performance and is suitable for real-time application scenarios that require quick responses.
Large Language Model
M
cortexso
1,768
1
Llama 3 Smaug 8B GGUF
GGUF format quantized model based on abacusai/Llama-3-Smaug-8B, supporting 2-8 bit quantization levels, suitable for text generation tasks
Large Language Model
L
MaziyarPanahi
8,904
5
GIGABATEMAN 7B GGUF
GIGABATEMAN-7B is a 7B-parameter large language model based on the Mistral architecture, focusing on text generation tasks.
Large Language Model English
G
mradermacher
115
3
Mamba 370m Hf
Mamba is an efficient language model based on the State Space Model (SSM), with the ability to model sequences with linear time complexity.
Large Language Model
Transformers

M
state-spaces
6,895
14
Featured Recommended AI Models